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Method and materials for producing deletion derivatives of polypeptides 

The present invention relates to genetic engineering and especially in vitro 
transposition. The invention describes a method and materials for producing deletion 
5 derivatives of polypeptide coding nucleic acids. In particular, the invention provides 
means for efficient generation of C-terminal deletions of polypeptides by the use of a 
modified transposon with translation stop codons in all three reading frames. The 
invention further provides a kit for producing said deletion derivatives. 

10 BACKGROUND OF THE INVENTION 

Thousands of different types of protein species constitute a major molecular component 
of cellular life. These molecules are composed of amino acid chains, the sequence of 
which is encoded by the genes in the organism's DNA. The protein function can be 

15 diverse and specific functions have been evolved for different cellular demands. Native 
wild type protein molecules can obviously be studied for their function biochemically 
and genetically. The data thus obtained can be informative but very often such 
information is relatively limited. A better description of protein function can be gained 
through mutational analysis in which various types of mutations are introduced into the 

20 protein primary sequence and the mutated proteins are then analyzed for their function. 
With current recombinant DNA technology (Sambrook et al. 1989, Sambrook and 
Russell 2001), generation of mutations is relatively easy and therefore mutational 
analysis of proteins has become a standard in functional studies of proteins. 

25 In principle, three different types of mutations can be introduced into a protein sequence 
(i) substitutions, (ii) insertions, and (Hi) deletions. In a substitution mutation, a 
particular amino acid (or an amino acid stretch), in a protein is changed to another (or to 
another amino acid stretch of same length). In an insertion, an amino acid or a stretch of 
amino acids is added to the protein thus increasing the length of the amino acid chain. 

30 In a deletion mutation, an amino acid or a stretch of amino acids are eliminated from the 
protein sequence and thus the protein becomes smaller in size. 



Various mutagenesis methods are currently available for generation of different types of 
mutations. These methods are typically straightforward to use. However, in most of the 
cases the wanted mutations are generated one by one and, therefore, their construction 
is time-consuming and labor-intensive. It would be desirable if a number of mutations 
could be generated simultaneously. For certain types of insertion mutations this type of 
approach has been described (Hayes and Hallet 2000). However, an efficient method for 
simultaneous generation of substitution and deletion mutations is still lacking. 

We utilised for the present invention a bacteriophage Mu-derived in vitro transposition 
system that has recently been introduced (Haapa et al. 1999a) and shown to function 
efficiently in many types of molecular biology applications (Wei et al. 1997, Tairaret al. 
1999, Haapa et al 1999 ab, Vilen et al. 2001). Mu transposition proceeds within the 
context of protein-DNA complexes that are called DNA transposition complexes or 
transpososomes (Mizuuchi 1991, Savilahti et al. 1995). These complexes are assembled 
from a tetramer of MuA transposase protein and Mu-transposon-derived DNA-end- 
segments (i.e. transposon end sequences recognised by MuA) containing MuA binding 
sites. When the complexes are formed they can react in divalent metal ion-dependent 
manner with any target DNA and splice the Mu end segments into the target (Savilahti 
et al 1995). In the simplest case, the MuA transposase protein and a short 50 bp Mu 
right-end (R-end) fragment are the only macromolecular components required for 
transpososome assembly (Savilahti et al. 1995, Savilahti and Mizuuchi 1996). 
Analogously, when two R-end sequences are located as inverted terminal repeats in a 
longer DNA molecule, transposition complexes form by synapsing the transposon ends. 
Target DNA in Mu DNA transposition in vitro can be linear, open circular, or 
supercoiled (Haapa et al. 1999a). 

Mu transposition complex, the machinery within which the chemical steps of 
transposition take place, is initially assembled from four molecules of MuA transposase 
protein that first bind specific binding sites in the transposon ends (Figs. 5A and 5B). 
The 50 bp Mu right end DNA segment contains two of these binding sites (they are 
called Rl and R2 and each of them is 22 bp long, Savilahti et al. 1995). When two ends, 
each bound by two MuA monomers, meet, the transposition complex is formed through 
conformational changes, the nature of which are not fully understood because of a lack 
of atomic resolution structural data on Mu transpososomes. However, the assembly of 



the minimal Mu transpososome is clearly dependent upon the correct binding of MuA 
transposase to Mu ends of the donor DNA. Thus, modifications in the conserved 
nucleotide sequence of transposon ends (e.g. Rl and R2 sequences in Mu R-end) should 
potentially have a negative effect on the efficiency of the transposition since every 

5 altered nucleotide conceivably interferes with the MuA binding. It has been documented 
(Lee and Harshey 2001, Coros and Chaconas 2001) that the two last base pairs in the 
Mu transposon end can be modified without severe effect on transpososome function. 
However, no detailed analysis has been conducted for elucidation of the effects of 
modified Rl and R2 binding sites. In one example (Laurent et al. 2000) a NotI 

10 restriction site was engineered close to the transposon end that changed one base pair in 
the Rl sequence. In vivo studies indicate that within the Rl and R2 sequences mutations 
generally have negative effects on transposition efficiency (Groenen et al. 1985, 1986). 
In addition, these effects are typically additive. 



15 SUMMARY OF THE INVENTION 



In this invention we describe a general methodology for making deletion derivatives of 
polypeptides using in vitro DNA transposition system. The method of the invention can 
be used to generate a number of deletion-derivatives of polypeptide coding nucleic 
20 acids simultaneously and with ease. 

We utilised modified transposons that allowed us to generate C-terminal deletion 
derivatives of polypeptides. The methodology should be applicable to any protein, the 
encoding nucleic acid sequence (e.g. a gene) of which is cloned in a plasmid or other 
25 DNA vector. 



In one aspect, the invention features a transposon nucleic acid comprising a genetically 
\*i engineered translation stop signal in three reading frames at least partly within a 

. : . transposon end binding sequence recognised by a transposase. 

• • • • 

30 

• • • 

y^ m In various embodiments the transposon nucleic acid of the invention may contain a 

. . . . : selectable marker and/or reporter gene. In one preferable embodiment the transposon 

end binding sequence of said transposon nucleic acid is Mu end sequence recognised by 
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MuA transposase. In one particular embodiment said Mu end sequence is Mu R-end 
sequence. 

In a second aspect, the invention provides a method for producing a deletion derivative 
of a polypeptide coding nucleic acid comprising the steps of: 

(a) performing a transposition reaction in the presence of the transposon nucleic acid of 
the invention and a target nucleic acid containing a polypeptide coding nucleic acid of 
interest, (b) recovering a target nucleic acid having said transposon nucleic acid 
incorporated in said polypeptide coding nucleic acid. 

In a preferred embodiment the method of the invention further comprises a step of (c) 
expressing said polypeptide coding nucleic acid having said transposon nucleic acid 
incorporated. 



15 In a third aspect, the invention provides a kit for producing deletion derivatives of 

polypeptide coding nucleic acids. The kit comprises the transposon nucleic acid of the 
invention. 

In a fourth aspect, the invention features use of the transposon nucleic acid of the 
20 invention for producing deletion derivatives of polypeptide coding nucleic acids. 

The term "transposon", as used herein, refers to a nucleic acid segment, which is 
recognised by a transposase or an integrase enzyme and which is essential component 
of a functional nucleic acid-protein complex capable of transposition (i.e. a 
25 transpososome). 

The term "transposase" used herein refers to an enzyme, which is an essential 
component of a functional nucleic acid-protein complex capable of transposition and 
which is mediating transposition. The term "transposase" also refers to integrases from 
30 retrotransposons or of retroviral origin. 

The expression "transposition reaction" used herein refers to a reaction wherein a 
transposon inserts into a target nucleic acid. Essential components in a transposition 



reaction are a transposon and a transposase or an integrase enzyme or some other 
components needed to form a functional transposition complex. The method and 
materials of the present invention are exemplified by employing in vitro Mu 
transposition (Haapa et al. 1999ab and Savilahti et al. 1995). However, the Mu 
transposition method is not critical for this invention. Other transposition systems can 
be used as well. Examples of such systems are Tyl (Devine and Boeke, 1994, and 
International Patent Application WO 95/23875), Transposon Tn7 (Craig, 1996), Tn 10 
and IS 10 (Kleckner et al. 1996), Mariner transposase (Lampe et al., 1996), Tel (Vos et 
al., 1996, 10(6), 755-61), Tn5 (Park et al., 1992), P element (Kaufinan and Rio, 1992) 
and Tn3 (Ichikawa and Ohtsubo, 1990), bacterial insertion sequences (Ohtsubo and 
Sekine, 1996), retroviruses (Varmus and Brown 1989) and retrotransposon of yeast 
(Boeke, 1989). 

The term "transposon end sequence" used herein refers to the conserved nucleotide 
sequences at the distal ends of a transposon. The transposon end sequences are 
responsible for identifying the transposon for transposition. 

The term "transposon end binding sequence" used herein refers to the conserved 
nucleotide sequences within the transposon end sequence whereto a transposase 
specifically binds when mediating transposition. 

The term "target nucleic acid" used herein refers to a nucleic acid molecule containing a 
protein coding nucleic acid of interest. 

The term "translation stop signal" used herein refers to the genetic code, which contains 
three codon triplets (UAA, UAG, UGA) for terminating the polypeptide chain 
production during protein synthesis in a ribosome. In a DNA strand the corresponding 
stop signal triplets are TAA, TAG and TGA. 

The term "reading frame" used herein refers to any sequence of bases in DNA or RNA 
that codes for the synthesis of either a protein or a component polypeptide. The point of 
initiation of reading determines the frame, i.e. the way in which the bases will be 
grouped in triplets as required by the genetic code. 



• • • 
« 

» • • • 



• • • 



The term "genetic engineering" used herein refers to molecular manipulation involving 
the construction of artificial recombinant nucleic acid molecules. 

The term "gene" used herein refers to genomic DNA or RNA that are translated into 
5 polypeptides. 



BRIEF DESCRIPTION OF THE DRAWINGS 
10 Figure 1. 

Cat-Mu transposons: Cat-Mu containing wild type Mu ends, Cat-Mu(iVo/I) containing 
Mu ends with engineered Notl restriction site, which design is described in Laurent et 
al. 2000, and Cat-Mu(Stop x 3) containing Mu ends with engineered translation stop 
signal in three reading frames (SEQ ID NO:2). Transposon end sequences (i.e. inverted 
15 terminal repeats) are drawn as rectangles. 

Figure 2. 

Transposon end sequences of Cat-Mu transposons: Cat-Mu transposon containing wild 
type Mu ends (SEQ ID NO:3), Cat-Mu(JVbfl) containing Mu ends with engineered Notl 
20 restriction site described in Laurent et al. 2000 (SEQ ID NO:4), and Cat-Mu(Stop x 3) 
containing Mu ends with engineered translation stop signal in three reading frames 
(SEQ ID NO;l). Asteriks (*) show modified nucleotides in the Mu ends of Cat- 
Ma(Notl) and Cat-Mu(Stop x 3). 

25 Figure 3. 

Analysis of C-terminal deletion variants on DNA level. Plasmids bearing Cat-Mu(Stop x 
3) transposon insertions (samples 1-24) were digested with BamHI, and they were 
analyzed on 1,8 % agarose gels. The length of the shortest fragment of each digest 
corresponds roughly to the length of the deletion variant protein gene (0- -650 bp). M = 
30 DNA standards. 
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Figure 4. 

Analysis of C-terminal deletion variants on protein level. The sizes of the deletion 
variant proteins, as predicted by sequencing analysis, are marked below each lane as 
kilodaltons. M= molecular weight standard, C + = positive control, C '= negative control. 
5 Predicted deletion variant protein products are pointed out by arrows. 
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Figures 5A and 5B 

5 A, Mu transposition complex. 5B, the assembly of Mu transposition complex. 
10 Figure 6. 

Overall strategy for production of C-terminal deletion variants of genes encoding 
proteins. 
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DETAILED DESCRIPTION OF THE INVENTION 



It has been published previously that protein engineering applications will benefit from 
Mu-based transposon strategies since it was established that any DNA sandwiched 
between Mu ends could be utilised as artificial transposons (Haapa et al. 1999a). In, 
principle insertion mutations (e.g. by addition of epitope tags or protein domains) and 

20 deletion mutations (by addition of translation stop codons) were foreseen with this 
strategy. However, introduction of a translation stop codon between transposon ends 
would leave a number of encoded amino acid residues into the protein's C-terminus. 
Given that an effective Mu end is about 50 bp in length, minimally this strategy would 
leave approximately 18 extra amino acids attached in the protein C-terminus. Extra 

25 amino acids may interfere with the protein function, therefore it would be better to add 
the stop codons as close as possible to the transposon end. By modifying the nucleotides 
of the Mu R-end (total of 7 nucleotides were changed, 5 of said nucleotides reside in 
Mu Rl sequence), we managed to place three stop codons in three reading frames very 
close to the Mu R-end resulting in transposons that still surprisingly retained their 

30 ability to form transposition complexes that were competent for transposition chemistry, 
i.e. they facilitated the integration of the transposon in vitro into a target plasmid. In 
essence, all the possible C-terminal deletion variants can be generated. 
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We designed an artificial Cat-Mu(Stop)-transposon (SEQ ID NO:2) conferring 
resistance to chloramphenicol that in its ends contained modified base pairs providing 
three stop codons in three reading frames (figs. 1 and 2). The gene mediating resistance 
to chloramphenicol is used as a selectable marker. The term "selectable marker" refers 
to a gene that, when carried by a transposon, alters the ability of a cell harboring the 
transposon to grow or survive in a given growth enviroment relative to a similar cell 
lacking the selectable marker. The transposon nucleic acid of the invention preferably 
contains a positive selectable marker. A positive selectable marker, such as an antibiotic 
resistance, encodes a product that enables the host to grow and survive in the presence 
of an agent, which otherwise would inhibit the growth of the organism or kill it. The 
transposon nucleic acid of the invention may also contain a reporter gene, which can be 
any gene encoding a product whose expression is detectable and/or quantitatable by 
immunological, chemical, biochemical, biological or mechanical assays. A reporter 
gene product may, for example, have one of the following attributes: fluorescence (e.g., 
green fluorescent protein), enzymatic activity (e.g., luciferase, /acZ/p-galactosidase), 
toxicity (e.g., ricin) or an ability to be specifically bound by a second molecule (e.g., 
biotin). The use of markers and reporter genes in prokaryotic and eukaryotic cells is 
well-known in the art. In a preferred embodiment the transposon nucleic acid of the 
invention may also contain genetically engineered restriction enzyme sites. For 
example, the selectable marker gene within the transposon of the invention may 
influence the protein expression when a construct obtained by the method of the 
invention is inserted into a protein expression plasmid. It is therefore desirable to 
engineer a pair of unique restriction sites to flank the selectable marker gene. The 
marker can then be removed easily by the use of these sites and thus the final expression 
construct would not contain the marker gene. 

Hence, one embodiment of the invention provides a transposon nucleic acid comprising 
a genetically engineered translation stop signal in three reading frames at least partly 
within a transposon end binding sequence recognised by a transposase. Preferably, the 
transposon nucleic acid of the invention comprises Mu transposon sequence. More 
preferably the transposon nucleic acid of the invention comprises Mu R-end sequence, 
e.g., the sequence of SEQ ID NO: 1 or SEQ ID NO:5 (Mu-R end sequence not including 
5' overhang, which thus can vary). In a transposon end sequence of the transposon 
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nucleic acid of the invention, translation stop signals of three reading frames are in 
5'-to-3' direction, preferably in succession close to each other at a very end of a 
transposon, thus the three stop signals are as near as possible the flanking sequence after 
the transposon is incorporated into a target. Furthermore, the transposon end sequences, 
which participate in the assembly of the transpososome discussed above, can be 
different from each other or they can be in different nucleic acid molecules. Preferably, 
both transposon end sequences participating in the transpososome have similar 
sequences (i.e. they are located as inverted terminal repeats). Although the transposon 
nucleic acid of the invention is exemplified here by transposons of Mu system only, a 
person skilled in the art understands that teachings of this invention can be utilised in 
other transposon systems as well. 

Another embodiment of the invention is a method for producing a deletion derivative of 
a polypeptide coding nucleic acid comprising the steps of: 
(a) performing a transposition reaction in the presence of a target nucleic acid 
containing a polypeptide coding nucleic acid (e.g. a gene) of interest and in the presence 
of a transposon containing a genetically engineered translation stop signal sequence in 
three reading frames at least partly within a transposon end binding sequence 
recognised by a transposase, (b) recovering a target nucleic acid having said transposon 
incorporated in said gene. 

The transposition reaction (a) includes a transposon in a form of linear DNA molecule, 
transposase protein (e.g. MuA), and a target DNA as macromolecular components. 
Additionally, the transposition reaction contains suitable buffer components including 
Mg2+ ions critical for chemical catalysis. Buffer components such as glycerol and 
DMSO (or related chemicals or solvents) somewhat relax the requirements for 
transposition reaction (Savilahti et al. 1995). Transposon DNA, in principle, can be of 
any length given that it in each end contain a Mu end sequence. Typically, target DNA is 
in a form of circular plasmid. However, any double-stranded DNA molecule more than 
25 bp is expected to serve as efficient target molecule (Savilahti et al. 1995, Haapa- 
Paananen et al. 2002). In transposition reaction the reaction components are incubated 
together; during the incubation transposition complexes first form and then react with 
target DNA splicing the transposon DNA into target DNA. This process yields 
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transposon integrations into target molecules. The stoichiometry of the reaction (excess 
target) generates target molecules each with a single integrated transposon. Most 
importantly, the integration site in each molecule can be different. Even though some 
sites in DNA are somewhat more preferred than others most of the phosphodiester 
5 bonds in DNA will be targeted (Haapa et al. 1999ab, Haapa-Paananen et al.2002). In 
practice this means that the integration sites are selected essentially randomly. 

In the Examples below deletion mutant libraries were planned to cover the gene of 
interest at least 10-fold, i.e. when the target gene was approximately 600 bp, the final 
10 pool should contain of a minimum of 6000 mutants. As a test protein we utilised 23 kDa 
yeast Msol protein (Aalto et al. 1997). Those skilled in the art can easily design 
different strategies for mutant library construction as such strategies are well-known in 
the art (see, e.g., Sambrook et al. 1989, Sambrook and Russell 2001). 

15 A mutant library was produced as described in Example 2. Target nucleic acids with a 
transposon insertion were isolated by size-selective preparative agarose gel 
electrophoresis. A person skilled in the art may design different isolation methods as 
such methods are well-known in the art (see, for example, Current Protocols in 
Molecular Biology, eds. Ausubel et al, John Wiley & Sons: 1992). We screened 

20 individual deletion mutants by restriction analysis (fig. 3). This analysis demonstrates 
that in the library, there are variants of different sizes. A person skilled in the art can 
easily utilise different screening techniques. The screening step can be performed, e.g., 
by methods involving sequence analysis, nucleic acid hybridisation, primer extension or 
antibody binding. These methods are well-known in the art (see, for example, Current 

25 Protocols in Molecular Biology, eds. Ausubel et al, John Wiley & Sons: 1992). 

We sequenced 23 C-terminal mutants. All the mutants carried the translation stop 
codons in three reading frames. 

30 Finally, the protein expression analysis (fig. 4) demonstrated that different deletion 
variant proteins are produced. Probably due to lack of resolution in the utilised gel 
system, the supposedly expressed protein was not detectable when the deletion 
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derivative was 8 kDa or smaller. Alternatively, very small versions of the Msol protein 
may be proteolytically degraded inside the cells. 

A further embodiment of the invention is a kit providing means for producing deletion 
derivatives of protein coding nuclear acid sequences. The kit comprises the transposon 
nucleic acid of the invention. The kit can be packaged in a suitable container and 
preferably it contains instructions for using the kit. 

The results of the invention show that, unexpectedly, it is possible to substantially 
modify conserved transposase binding sequences of the Mu transposon ends without 
critically compromising the competence of the modified transposon to assemble 
transposition complexes and thereafter carry out transposition chemistry. Thus, the 
invention provides a straightforward solution to the problem of extra amino acids 
attached in the protein C-terminus of the deletion derivative which could be produced 
by a conventional transposition system, wherein the transposon used contains the 
translation stop signals between the transposon ends. 

The present invention is further described in the following examples, which are not 
intended to limit the scope of the invention. 

EXAMPLES 

EXAMPLE 1 

In vitro transposition reaction 

In vitro transposition reaction (25 \xl) contained 720 ng cat-Mu(Stop) transposon as a 
donor, 500 ng plasmid pHis6-MS01 as a target nucleic acid, 0.2 |ig MuA, 25 mM Tris- 
HC1 atpH 8.0, 100 fig/ml BSA, 15% (w/v) glycerol, 0.05% (w/v) Triton X-100, 126 
mM NaCl and 10 mM MgCl 2 . The reaction was carried out at 30°C for 4h. 

Further details and variables of in vitro Mu transposition are described in Haapa et al. 
1999ab and Savilahti et al. 1995, incorporated herein by reference. 
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EXAMPLE 2 

5 Generation of a pool of mutants with C-terminal deletions in Msol 

In vitro transposition reactions with Stop-Mu were performed essentially as described in 
Haapa et al. (1999a) with the exception that they contained 720 ng donor DNA (Stop- 
Mu x 3) and 0,88 fig MuA. Ten reactions were pooled, phenol and chlorophorm 
extracted, ethanol precipitated, and resuspended in 30 ^il of water. Several 1 |al aliquots 

10 were electrotransformed, each into 25 \xl of DH5a electrocompetent cells, as described 
(Haapa et al 1999a). Transposon-containing plasmid clones were selected on LB plates 
containing Ap and Cm. A total of ~6x 10 5 colonies were pooled and grown in selective 
LB-Ap-Cm medium at 37 °C for 3h after which plasmid DNA was prepared from the 
pool with Qiagen Plasmid Midi kit. This plasmid preparation was subjected to a Xhol- 

15 HindlR double digestion and preparative agarose gel electrophoresis. The DNA 
fragment corresponding to transposon insertions into the M?o/-containing DNA 
fragment was isolated with QIAquick Gel Extraction Kit (Qiagen). This fragment was 
then ligated into the plasmid pHis6-MS01 vector XhoI-HindUL backbone to generate a 
construct pool with transposon insertions located only within the Msol gene. After 

20 ligation, a pool of plasmids from ~5x 10 4 colonies was prepared as described above. 
Approximately 110 000 colonies were pooled. Transposon-carrying Msol fragments 
were cloned into clean vector backbone as described above and approximately 1 1 000 
colonies were pooled in the final C-terminal deletion mutant library. At all stages, the 
transformants were selected with Ap and Cm. 

25 

EXAMPLE 3 

Restriction and expression analysis of deletion mutants 

Mutant clones were analyzed for deletions by BarriHL digestion and DNA sequencing. 
30 For protein expression analysis, single mutant plasmids were introduced into 

BL21(DE3) expression strain. Selective medium was inoculated with o/n culture of 
bacteria containing mutant plasmid and grown until ODeoo was 0,4-0,7. Protein 
expression was induced with 1 mM IPTG for 3 hours and samples were withdrawn for 
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SDS-PAGE analysis. Bacterial lysates were run on 15 % gels and stained with GelCode 
blue stain (Pierce) as recommended by the supplier. 

MATERIALS AND METHODS 

5 

Bacteria, media, enzymes and reagents 

Bacterial cultures were grown in Luria broth supplemented with appropriate antibiotics: 

ampicillin (Ap) at 100 jag/ml and chloramphenicol (Cm) at 10 fig/ml when required. 

Escherichia coli strains were DH5a (Life Technologies) and BL21(DE3) (Novagen). 
10 MuA protein was purified in collaboration with Finnzymes (Espoo, Finland) essentially 

as described (Baker et ah 1993, Haapa et ah 1999a). Restriction enzymes and T4 DNA 

ligase were from New England Biolabs and Promega, Triton X-100 from Roche. 

Standard DNA techniques were performed as described (Sambrook and Russell 2001). 

Enzymes were used as. recommended by suppliers. Sequencing was carried out at the 
15 sequencing service unit of the Institute of Biotechnology, University of Helsinki. 



Plasmids and transposons 

Plasmid pHis6-MS01 contains the 633 bp Msol gene as an insert (Aalto et ah 1997). 
The Cat-Mu(Stop) transposon (1254 bp) is a derivative of the Cat-Mu transposon 
20 (Haapa et ah 1999a), and they encode resistance to chloramphenicol (fig. 1 and 2). The 
Cat-Mu(Stop)-transposon ends were engineered to carry translation stop signals for 
both 5'-to-3 ' directions of dsDNA in all three reading frames. Sequencing was 
performed using the primer 5 '-get agt tat tgc tea gcg g-3 ' (SEQ ID NO:5). 
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We claim: 

1. A transposon nucleic acid comprising a genetically engineered translation stop signal 
in three reading frames at least partly within a transposon end binding sequence 
recognised by a transposase. 

2. The transposon nucleic acid according to claim 1, wherein said transposon contains a 
selectable marker and/or a reporter gene. 

3. The transposon nucleic acid according to claim 1 or 2, wherein said transposon end 
binding sequence is Mu end binding sequence. 

4. The transposon nucleic acid according to claim 3, wherein said Mu end binding 
sequence is Mu R-end binding sequence. 

5. The transposon nucleic acid according to claim 4, wherein said transposon sequence 
is set forth in SEQ ID NO:l, SEQ ID NO:2 or SEQ ID NO:5. 

6. The transposon nucleic acid according to any one of the preceding claim, wherein 
said transposon further contains a genetically engineered restriction enzyme site. 

7. Method of producing a deletion derivative of a polypeptide coding nucleic acid 
comprising the steps of: 

(a) performing a transposition reaction in the presence of a target nucleic acid 
containing a polypeptide coding nucleic acid of interest and in the presence of a 
transposon containing a genetically engineered translation stop signal sequence in three 
reading frames at least partly within a transposon end binding sequence recognised by a 
transposase, (b) recovering a target nucleic acid having said transposon incorporated in 
said protein coding nucleic acid. 

8. The method according to claim 7 further comprising a step of (c) expressing said 
protein coding nucleic acid having said transposon incorporated. 




9. The method according to claim 7 or 8, wherein said transposon comprises the 
transposon nucleic acid of any one of claims 2-6. 

10. A kit for producing deletion derivatives of polypeptide coding nucleic acids 
comprising the transposon nucleic acid of any one of claims 1-6. 



11. Use of the transposon nucleic acid of any one of claims 1-6 for producing deletion 
derivatives of polypeptide coding nucleic acids. 
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The present invention describes an in vitro transposition-based methodology for 
generation of deletion derivatives of polypeptides. A modified artificial transposon was 
designed that, close to the transposon ends, contained a modification with translation 
stop codons in three reading frames. Transposition complexes can be assembled using 
the modified transposon and essentially random integrations into the target plasmid, 
containing a polypeptide coding nucleic acid of interest, were recovered as a plasmid 
pool. Subsequent manipulation steps including restriction enzyme digestions and 
ligation resulted in pools of mutant clones from which deletion derivatives of a 
polypeptide coding nucleic acid of interest and its respective deletion polypeptides 
could be produced. 
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SEQUENCE LISTING 

<110> Finnzymes Oy 

<120> Method and materials for producing deletion derivatives 
of polypeptides 



<140> 
<141> 

<160> 6 

<170> Patent In Ver. 2.1 

<210> 1 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified Mu 
end sequence 

<400> 1 

gatctgattg attgaacgaa aaacgcgaaa gcgtttcacg ataaatgcga aaac 54 



<210> 2 
<211> 1254 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified Mu 
transposon 

<400> 2 

gatctgattg attgaacgaa aaacgcgaaa gcgtttcacg ataaatgcga aaacggatcc 60 

tatcgtcaat tattacctcc acggggagag cctgagcaaa ctggcctcag gcatttgaga 120 

agcacacggt cacactgctt ccggtagtca ataaaccggt aaaccagcaa tagacataag 180 

cggctattta acgaccctgc cctgaaccga cgaccgggtc gaatttgctt tcgaatttct 240 

gccattcatc cgcttattat cacttattca ggcgtagcaa ccaggcgttt aagggcacca 300 

ataactgcct taaaaaaatt acgccccgcc ctgccactca tcgcagtact gttgtaattc 360 

attaagcatt ctgccgacat ggaagccatc acaaacggca tgatgaacct gaatcgccag 420 

cggcatcagc accttgtcgc cttgcgtata atatttgccc atggtgaaaa cgggggcgaa 480 

gaagttgtcc atattggcca cgtttaaatc aaaactggtg aaactcaccc agggattggc 540 

tgagacgaaa aacatattct caataaaccc tttagggaaa taggccaggt tttcaccgta 600 

acacgccaca tcttgcgaat atatgtgtag aaactgccgg aaatcgtcgt ggtattcact 660 

ccagagcgat gaaaacgttt cagtttgctc atggaaaacg gtgtaacaag ggtgaacact 720 

atcccatatc accagctcac cgtctttcat tgccatacgt aattccggat gagcattcat 780 

caggcgggca agaatgtgaa taaaggccgg ataaaacttg tgcttatttt tctttacggt 840 

ctttaaaaag gccgtaatat ccagctgaac ggtctggtta taggtacatt gagcaactga 900 

ctgaaatgcc tcaaaatgtt ctttacgatg ccattgggat atatcaacgg tggtatatcc 960 

agtgattttt ttctccattt tagcttcctt agctcctgaa aatctcgaca actcaaaaaa 1020 

tacgcccggt agtgatctta tttcattatg gtgaaagttg gaacctctta cgtgccgatc 1080 

aacgtctcat tttcgccaaa agttggccca gggcttcccg gtatcaacag ggacaccagg 1140 

atttatttat tctgcgaagt gatcttccgt cacaggtatt tattcggtcg aaaaggatcc 1200 

gttttcgcat ttatcgtgaa acgctttcgc gtttttcgtt caatcaatca gate 1254 



<210> 3 
<211> 54 
<212> DNA 

<213> Bacteriophage Mu 
<400> 3 

gatctgaagc ggcgcacgaa aaacgcgaaa gcgtttcacg ataaatgcga aaac 

<210> 4 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified Mu 
end sequence 

<400> 4 

gatctgcggc cgcgcacgaa aaacgcgaaa gcgtttcacg ataaatgcga aaac 

<210> 5 
<211> 50 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Modified Mu 
end sequence without 5 • overhang 

<400> 5 

tgattgattg aacgaaaaac gcgaaagcgt ttcacgataa atgcgaaaac 

<210> 6 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Sequencing 
primer 

<400> 6 

gctagttatt gctcagcgg 
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